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Abstract. The homomorphism problem for relational structures is an abstract way of formulating 
constraint satisfaction problems (CSP) and various problems in database theory. The decision version 
of the homomorphism problem received a lot of attention in literature; in particular, the way the 
graph-theoretical structure of the variables and constraints influences the complexity of the problem 
is intensively studied. Here we study the problem of enumerating all the solutions with polynomial 
delay from a similar point of view. It turns out that the enumeration problem behaves very differently 
from the decision version. We give evidence that it is unlikely that a characterization result similar to 
the decision version can be obtained. Nevertheless, we show nontrivial cases where enumeration can 
be done with polynomial delay. 



1. Introduction 

Constraint satisfaction problems (CSP) form a rich class of algorithmic problems with applica- 
tions in many areas of computer science. We only mention database systems, where CSPs appear 
in the guise of the conjunctive query containment problem and the closely related problem of eval- 
uating conjunctive queries. It has been observed by Feder and Vardi |[T4]| that as abstract problems, 
CSPs are homomorphism problems for relational structures. Algorithms for and the complexity 
of constraint satisfaction problems have been intensely studied (e.g. Ii20l [TOl |4l |5l), not only for 
the standard decision problems but also optimization versions (e.g. ||3l HH |23l H^l) and counting 
versions (e.g. El El El [H) of CSPs. 

In this paper we study the CSP enumeration problem, that is, problem of computing all solutions 
for a given CSP instance. More specifically, we are interested in the question which structural 
restrictions on CSP instances guarantee tractable enumeration problems. "Structural restrictions" 
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are restrictions on the structure induced by the constraints on the variables. Example of structural 
restrictions is "every variable occurs in at most 5 constraints" or "the constraints form an acyclic 
hypergraphQ" This can most easily be made precise if we view CSPs as homomorphism problems: 
Given two relational structures A, B, decide if there is a homomorphism from A to B. Here the 
elements of the structure A correspond to the variables of the CSP and the elements of the structure 
B correspond to the possible values. Structural restrictions ai^e restrictions on the structure A. If A is 
a class of structures, then CSP(^, — ) denotes the restriction of the general CSP (or homomorphism 
problem) where the "left hand side" input structure A is taken from the class A. ECSP(^, — ) 
denotes the corresponding enumeration problem: Given two relational structures A G ^ and B, 
compute the set of all homomorphisms from A to B. The enumeration problem is of particular 
interest in the database context, where we are usually not only interested in the question of whether 
the answer to a query is nonempty, but want to compute all tuples in the answer. We will also briefly 
discuss the corresponding search problem: Find a solution if one exists, denoted SCSP(^, — ). 

It has been shown in [2] that ECSP(^, — ) can be solved in polynomial time if and only if the 
number of solutions (that is, homomorphisms) for all instances is polynomially bounded in terms 
of the input size and that this is the case if and only if the structures in the class A have bounded 
fractional edge cover number. However, usually we cannot expect the number of solutions to be 
polynomial. In this case, we may ask which conditions on A guarantee that ECSP(^, — ) has a 
polynomial delay algorithm. A polynomial delay algorithm for an enumeration problem is required 
to produce the first solution in polynomial time and then iteratively compute all solutions (each 
solution only once), leaving only polynomial time between two successive solutions. In particular, 
this guarantees that the algorithms computes all solutions in polynomial total time, that is, in time 
polynomial in the input size plus output size. 

It is easy to see that ECSP(^, — ) has a polynomial delay algorithm if the class A has bounded 
tree width. It is also easy to see that there are classes A of unbounded tree width such that 
ECSP(^, — ) has a polynomial delay algorithm. It follows from our results that examples of such 
classes are the class of all grids or the class of all complete graphs with a loop on every vertex. It 
is known that the decision problem CSP(^, — ) is in polynomial time if and only if the cores of the 
structures in A have bounded tree width [17] (provided the arity of the constraints is bounded, and 
under some reasonable complexity theoretic assumptions). A core of a relational structure ^ is a 
minimal substructure A' A such that there is a homomorphism from ^ to minimality is with 
respect to inclusion. It is easy to see that all cores of a structure are isomorphic. Hence we usually 
speak of "the" core of a structure. Note that the core of a grid (and of any other bipartite graph with 
at least one edge) is a single edge, and the core of a complete graph with all loops present (and of 
any other graph with a loop) is a single vertex with a loop on it. The core of a complete graph with 
no loops is the graph itself. As a polynomial delay algorithm for an enumeration algorithms yields 
a polynomial time algorithm for the corresponding decision problem, it follows that ECSP(^, — ) 
can only have a polynomial delay algorithm if the cores of the structures in A have bounded tree 
width. Unfortunately, there are examples of classes A that have cores of bounded tree width, but 
for which ECSP(^, — ) has no polynomial delay algorithm unless P = NP (see Example 13.21 ). 

Our main algorithmic results show that ECSP(>4^, — ) has a polynomial delay algorithm if the 
cores of the structures in A have bounded tree width and if, in addition, they can be reached in a 
sequence of "small steps." An endomorphism of a structure is a homomorphism of a structure to 
itself. A retraction is an endomorphism that is the identity mapping on its image. Every structure 

'The other type of restrictions studied in the hterature on CSP are "constraint language restrictions", that is, restrictions 
on the structure imposed by the constraint relations on the values. An example of a constraint language restriction is "all 
clauses of a SAT instance, viewed as a Boolean CSP, are Horn clauses". 
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has a retraction to its core. However, in general, the only way to map a structure to its core may 
be by collapsing the whole structure at once. As an example, consider a path with a loop on both 
endpoints. The core consists of a single vertex with a loop. (More precisely, the two cores are the 
two endpoints with their loops.) The only endomorphism of this structure to a proper substructure 
maps the whole structure to its core. Compare this with a path that only has a loop on one endpoint. 
Again, the core is a single vertex with a loop, but now we can reach the core by a sequence of 
retractions, mapping a path of length n to a subpath of length n — 1 and then to a subpath of length 
n — 2 et cetera. We prove that if ^ is a class of structures whose cores have bounded tree width 
and can be reached by a sequence of retractions each of which only moves a bounded number of 
vertices, then ECSP(^, — ) has a polynomial delay algorithm. 

We also consider more general sequences of retractions or endomorphism from a structure to 
its core. We say that a sequence of endomorphisms from a structure Aq to a substructure Ai C Aq, 
from Ai to a substructure A2, . . . , to a structure A„ has bounded width if A„ and, for each i < n, the 
"difference between Aj and Aj_i" has bounded tree width. We prove that if we are given a sequence 
of endomorphisms of bounded width together with the input structure A, then we can compute all 
solutions by a polynomial delay algorithm. Unfortunately, in general we cannot compute such a 
sequence of endomorphisms efficiently. We prove that even for width 1 it is NP-complete to decide 
whether such a sequence exists. 

Finally, we remark that our results are far from giving a complete classification of the classes A 
for which ECSP(^, — ) has a polynomial delay algorithm and those classes for which it does not. 
Indeed, we show that it will be difficult to obtain such a classification, because such a classification 
would imply a solution to the notoriously open CSP dichotomy conjecture of Feder and Vardi |[T4l 
(see Section[3]for details). 

Due to space restrictions several proofs are omitted. 

2. Preliminaries 

Relational structures. A vocabulary r is a finite set of relation symbols of specified arities. A 
relational structure A over r consists of a finite set A called the universe of A and for each relation 
symbol R £ t, say, of arity r, an r-ary relation C A^. Note that we require vocabularies and 
structures to be finite. A structure A is a substructure of a structure Mif A Q B and R^ C R^ for 
all E T. We write A C B to denote that A is a substructure of B and A C B to denote that A is 
a proper substructure of B, that is, A C B and A / B. A substructure A C B is induced if for all 
R G T, say, of arity r, we have R^ = i?" fl A^. For a subset A C B,we write M[A] to denote the 
induced substructure of B with universe A. 

Homomorphisms. We often abbreviate tuples (oi, . . . , afc) by a. If / is a mapping whose domain 
contains oi, . . . , we write /(a) to abbreviate (/(oi), . . . , f{ak))- A homomorphism from a 
relational structure A to a relational structure B is a mapping ip : A —>■ B such that for all G r 
and all tuples a e R^ we have (/?(a) G R^. A partial homomorphism on C C ^ to B is a 
homomorphism of A[C] to B. It is sometimes useful when designing examples to exclude certain 
homomorphisms or endomorphisms. The simplest way to do that is to use unary relations. For 
example, if i? is a unary relation and (a) € R^ we say that a has color R. Now if b £ B does not 
have color R then no homomorphism from A to B maps a to b. 

Two structures A and B are homomorphically equivalent if there is a homomorphism from A 
to B and also a homomorphism from B to A. Note that if structures A and A' are homomorphically 
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equivalent, then for every structure B there is a homomorphism from A to B if and only if there is a 
homomorphism from A' to B; in other words: the instances (A, B) and (A', B) of the decision CSP 
are equivalent. However, the two instances may have vastly different sizes, and the complexity of 
solving the search and enumeration problems for them can also be quite different. Homomorphic 
equivalence is closely related to the concept of the core of a structure: A structure A is a core if there 
is no homomorphism from A to a proper substructure of A. A core of a structure A is a substructure 
A' C A such that there is a homomorphism from A to A' and A' is a core. Obviously, every core 
of a structure is homomorphically equivalent to the structure. We observe another basic fact about 
cores: 

Observation 2.1. Let A and B be homomorphically equivalent structures, and let A' and B' be cores 
of A and B, respectively. Then A' and B' are isomorphic. In particular, all cores of a structure A are 
isomorphic. Therefore, we often speak of the core of A. 

Observation 2.2. It is easy to see that it is NP-hard to decide, given structures A C B, whether A is 
isomorphic to the core of B. (For an arbitrary graph G, let A be a triangle and B the disjoint union 
of G with A. Then A is a core of B if and only if G is 3-colorable.) Hell and Nesetfil llT9l proved 
that it is co-NP-complete to decide whether a graph is a core. 

Tree decompositions. A tree decomposition of a graph G is a pair (T, S), where T is a tree and B 
is a mapping that associates with every node t G 1^(7") a set Bt C V{G) such that (1) for every 
V G y{G) the set {t G y(J)\v G Bt\ is connected in T, and (2) for every e G E{G) there is a 
t G yiT^ such that e C Bt- The sets Bt, for t G 1^(7"), are called the hags of the decomposition. It 
is sometimes convenient to have the tree T in a tree decomposition rooted; we always assume it is. 
The width of a tree decomposition (T, i?) is max{|i?t| | t G V^(r)} — 1. The tree width of a graph 
G, denoted by tw(G), is the minimum of the widths of all tree decompositions of G. 

We need to transfer some of the notions of graph theory to arbitrary relational structures. The 
Gaifman graph (also known as primal graph) of a relational structure A with vocabulary r is the 
graph G(A) with vertex set A and an edge between a and 6 if a / 6 and there is a relation symbol 
R ^ T, say, of arity r, and a tuple (ai, . . . , a^) G such that a, 6 G {ai, . . . , Or}. We can now 
transfer graph-theoretic notions to relational structures. In particular, a subset B C Ais connected 
in a structure A if it is connected in G(A). A tree decomposition of a structure A can simply be 
defined to be a tree-decomposition of G(A). Equivalently, a tree decomposition of A can be defined 
directly by replacing the second condition in the definition of tree decompositions of graphs by (2') 
for every i? G r and (ai, . . . , a^) G R^ there is a t G y(T) such that {ai, . . . , a^} C Bf. A class C 
of structures has bounded tree width if there is a i(; G N such that tw(A) < w for all A G C. A class 
C of structures has bounded tree width modulo homomorphic equivalence if there is a if G N such 
that every A G C is homomorphically equivalent to a structure of tree width at most w. 

Observation 2.3. A structure A is homomorphically equivalent to a structure of tree width at most 
w if and only if the core of A has tree width at most w. 

The Constraint Satisfaction Problem. For two classes A and B of structures, the Constraint Sat- 
isfaction Problem, CSP(^, B), is the following problem: 

CSP(AS) 

Instance: A G ^, B G i3 

Problem: Decide if there is a homomorphism from A to B. 
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The CSP is a decision problem. The variation of it we study in this paper is the following 
enumeration problem: 

ECSP(A B) 

Instance: A e A,M £ B 

Problem: Output all the homomorphisms from A to B. 

We shall also refer to the search problem, SCSP(^, B), in which the goal is to find one solution 
to a CSP-instance or output 'no' if a solution does not exists. 

If one of the classes A, B is the class of all finite structures, then we denote the correspond- 
ing CSPs by CSP(A-), CSP(-,0) (respectively, ECSP(^, -), ECSP(-,e), SCSP(A-)> 
SCSP(-,0)). 

The decision CSP has been intensely studied. If a class C of structures has bounded arity 
then CSP(C, — ) is solvable in polynomial time if and only if C has bounded tree width modulo 
homomorphic equivalence 117]. If the arity of C is not bounded, several quite general conditions on 
a class of structures have been identified that guarantee polynomial time solvability of CSP(C, — ), 
see, e.g. |[T6l[T2l[T8]| . Problems of the form CSP(— ,C) have been studied mostly in the case when 
C is 1 -element. Problems of this type are sometimes referred to as non-uniform. It is conjectured 
that every non-uniform problem is either solvable in polynomial time or NP-complete (the so-called 
Dichotomy Conjecture) [14]. Although this conjecture is proved in several particular cases 1 20. ,91 
UniUl, in its general form it is believed to be very difficult. 

A search CSP is clearly no easier than the corresponding decision problem. While any non- 
uniform search problem SCSP(— , C) is polynomial time reducible to its decision version CSP(— , C) 
inn , nothing is known about the complexity of search problems SCSP(C, — ) except the result we 
state in Section [3] Paper 1251 provides some initial results on the complexity of non-uniform enu- 
merating problems. 

3. Tractable structures for enumeration 

Since even an easy CSP may have exponentially many solutions, the model of choice for 'easy' 
enumeration problems is algorithms with polynomial delay 11211 . An algorithm Alg is said to solve 
a CSP with polynomial delay (WPD for short) if there is a polynomial p{n) such that, for every 
instance of size n, Alg outputs 'no' in a time bounded by p{n) if there is no solution, otherwise it 
generates all solutions to the instance such that no solution is output twice, the first solution is output 
after at most p{n) steps after the computation starts, and time between outputting two consequent 
solutions does not exceed p{n). 

If a class of relational structures C has bounded arity, the aforementioned result of Grohe ifTTl 
imposes strong restrictions on enumeration problems solvable WPD. 

Observation 3.1. If a class of relational structures C with bounded arity does not have bounded tree 
width modulo homomorphic equivalence, then ECSP(C, — ) is not WPD, unless P=NP. 

Unlike for the decision version, the converse is not true: bounded tree width modulo homomor- 
phic equivalence does not imply enumerability WPD. 

Example 3.2. Let be the disjoint union of a A;-clique and a loop and let A = {A^ \ k > 
1}. Clearly, the core of each graph in A has bounded tree width (in fact, it is a single element), 
hence CSP(^, — ) is polynomial-time solvable. For an arbitrary graph B without loops, let B' 
be the disjoint union of B and a loop. It is clear that there is always a trivial homomorphism 
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from Ak (for any A; > 1) to B' that maps everything into the loop. There exist homomorphisms 
different from the trivial one if and only if B contains a A;-cUque. Thus if we are able to check 
in polynomial time whether there is a second homomorphism, then we are able to test if B has a 
fc-clique. Therefore, although CSP(^, — ) and SCSP(>1, — ) are polynomial-time solvable, a WPD 
enumeration algorithm for ECSP(^, — ) would imply P = NP. 

It is not difficult to show that ECSP(C, — ) is enumerable WPD if C has bounded tree width. 
For space restrictions we do not include a direct proof and instead we derive it from a more general 
result in Section ID Thus enumerability WPD has a different tractability criterion than the decision 
version, and this criterion lies somewhere between bounded tree width and bounded tree width 
modulo homomorphic equivalence. Thus in order to ensure that the solutions can be enumerated 
WPD, we have to make further restrictions on the way the structure can be mapped to its bounded 
tree width core. The main new definition of the paper requires that the core is reached by "small 
steps": 

Let A be a relational structure with universe A. We say that A has a sequence of endomorphisms 
of width k if there are subsets A = Aq D Ai D . . . D An ^ and homomorphisms ipi,. . . ,ipn 
such that 

(1) ipi is a homomorphism from A[A;_i] to A[Aj], 

(2) ipi{Ai.i) = Aiforl<i<n; 

(3) if G is the primal graph of A, then the tree width of G[Ai \ Aj+i] is at most k for every 

<i < n; 

(4) the structure induced by An has tree width at most k. 

In SectionlH we show that enumeration for (A, B) can be done WPD if a sequence of bounded 
width endomorphisms for A is given in the input. Unfortunately, we cannot claim that ECSP(^, — ) 
can be done WPD if every structure in A has such a sequence, since we do not know how to find 
such sequences efficiently. In fact, as we show in Section[5l it is hard to check if a width- 1 sequence 
exists for a given structure. Furthermore, we show a class A where every structure has a width-2 
sequence, but ECSP(^, — ) cannot be done WPD, unless P = NP. This means that it is not possible 
to get around the problem of not being able to find the sequences (for example, by finding sequences 
with somewhat larger width or by constructing the sequence during the enumeration). 

Thus having a bounded width sequence of endomorphisms is not the right tractability crite- 
rion. We then investigate a more restrictive notion, where the bound is not on the tree width of the 
difference of the layers but on the number of elements in the differences. However, in the rest of 
the section, we give evidence that enumeration problems solvable WPD cannot be characterized in 
simple terms relying on tree width. For instance, a description of search problems solvable in poly- 
nomial time would imply a description of non-uniform decision problems solvable in polynomial 
time. This is shown via an analogous result for the search version of the problem, which might be 
of independent interest. By A © B we denote the disjoint union of relational structures A and B. 

Lemma 3.3. Let M be a relational structure, which is a core, and let Cb be {A © B | A ^ B}. Then 
CSP(— , B) is solvable in polynomial time if and only if so is the problem SCSP(Cb, — ). 

Proof. If the decision problem CSP(— , B) is solvable in polynomial time we can construct an algo- 
rithm that given an instance (A, C) of CSP(Cb, — ) computes a solution in polynomial time. Indeed, 
asCSP(— ,B)is solvable in polynomial time by the aforementioned result of fTl j it is also polyno- 
mial time to find a homomorphism from a given structure to B provided one exists. If A G Cb such a 
homomorphism tp exists by the definition of Cb. So our algorithms, first, finds some homomorphism 
if. Then it decides by brute force whether or not there exists a homomorphism (/?' from B to C (note 
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that this can be done in polynomial time for every fixed B). If such a homomorphism does not exist 
then we can certainly guarantee that there is no homomorphism from A to C. Otherwise we obtain 
a required homomorphism ip as follows: Let ip{(^) = ^'{o.) for a £ M, and Tp{a) = cp' o ip(a) for 
a G A. 

Conversely, assume that we have an algorithm Alg that finds a solution of any instance of 
CSP(Cb, — ) in polynomial time, say, p{n). We construct from it an algorithm that solves CSP(— , B). 
Given an instance (A, B) of CSP(-, B) we call algorithm Alg with input A©B and B. Additionally 
we count the number of steps performed by Alg in such a way that we stop if Alg has not finished 
in p{n) steps. If Alg produces a correct answer then we have to be able to obtain from it a homo- 
morphism from A to B. If Alg's answer is not correct or the clock reaches p{n) steps we know that 
Alg failed. The only possible reason for that is that A © B does not belong to Cm, which implies that 
A is not homomorphic to B. ■ 

In what follows we transfer this result to enumeration problems. Let ^ be a class of relational 
structures. The class A' consists of all structures built as follows: Take A £ A and add to it |A| 
independent vertices. 

Lemma 3.4. Let Abe a class of relational structures. Then SCSP(^, — ) is solvable in polynomial 
time if and only //■ECSP(^', — ) is solvable WPD. 

Proof. If ECSP(^, — ) is enumerable WPD, then for any structure A' € A' it takes time polynomial 
in I A' I to find the first solution. Since A' is only twice of the size of the corresponding structure A, 
it takes only polynomial time to solve SCSP(^, — ). 

Conversely, given a structure A' = AU I £ A', where A G A and / is the set of independent 
elements, and any structure B. The first homomorphism from A' to B can be found in polynomial 
time, since SCSP(>4^, — ) is polynomial time solvable and the independent vertices can be mapped 
arbitrarily. Let the restriction of this homomorphism onto A be if. Then while enumerating all 
possible |B|I*I extensions of (f we buy enough time to enumerate all homomorphisms from A to B 
using brute force. ■ 



4. Sequence of bounded width endomorphisms 

In this section we show that for every fixed k, all the homomorphisms from A to B can be 
enumerated with polynomial delay if a sequence of width k endomorphisms of A is given in the 
input. Given a sequence Aq, . . . ,An and . . . , 99„ as in the definition of a sequence of width k 
endomorphisms, we denote A[Ai] by Aj. 

We will enumerate the homomorphisms from A to B by first enumerating the homomorphisms 
from A„, A„_i, ... to B and then transforming them to homomorphisms from A to B using the 
homomorphisms ipi. We obtain the homomorphisms from Aj by extending the homomorphism 
from Aj+i to the set Ai \ Aj+i; Lemma |4~T] below will be useful for this purpose. In order to 
avoid producing a homomorphism multiple times, we need a delicate classification (see definitions 
of elementary homomorphisms and of the index of a homomorphism). 

Lemma 4.1. Let A,Mbe relational structures and Xi C X2 Q A subsets, and let go be a homomor- 
phism from A[Xi\ to B. For every fixed k, there is a polynomial-time algorithm HOMOMORPHISM- 
Ext(A, B, Xi, X2, go) that decides whether go can be extended to a homomorphism from A[X2] to 
B, if the tree width of induced subgraph G[X2 \ Xi] of the Gaifman graph of A is at most k. 
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The index of a homomorphism 99 from A to B is the largest t such that can be written as 
ip = o o . . . o for some homomorphism i/j from At to B. In particular, if cannot be 
written as (p = ip o ipi, then the index of ip is 0. Observe that if the index of ip is at least t, then 
there is a unique tp such that if = ^p o ip^ o . . . o ipi. This follows from the fact that ipt o . . . o ipi 
is a surjective mapping from A to At, thus if tj/ and V'" differ on At, then Tp' o ipt o . . . o ipi and 
Tp" o ipt o . . . o (pi differ on ^. A homomorphism from A^ to B is elementary, if it cannot be 
written as ijj = ip' o tpt+i- A homomorphism is reducible if it is not elementary. 

Lemma 4.2. If a homomorphism ip from At to B is elementary, then ip = ip o cpt o . . . o ipi has 
index exactly t. Conversely, if homomorphism <p> from A to B has index t and can be written as 
(p = ip o ipt o . . . o (pi, then the homomorphism ipfrom At to B is elementary. 

Lemma 14.21 suggests a way of enumerating all the homomorphisms from A to B: for t = 
0, . . . , n, we enumerate all the elementary homomorphisms from A^ to B, and for each such homo- 
morphism ip, we compute ip = ipoipto . . .oipi.To this end, we need the following characterization 
of elementary homomorphisms: 

Lemma 4.3. A homomorphism 'ip from At to B is reducible if and only if 

(1) ^p{x) = tp{y) for every x,y e At with ipt+iix) = (pt+i{y), i.e., for every z G At+i, ip{x) 
has the same value bzfor every x with ipt+i{x) = z, and 

(2) the mapping defined by ip'{z) := is a homomorphism from A^+i to B. 

Lemma 14.31 gives a way of testing in polynomial time whether a given homomorphism ip is 
elementary: we have to test whether one of the two conditions are violated. We state this in a more 
general form: we can test in polynomial time whether a partial mapping qq can be extended to an 
elementary homomorphism ip, if the structure induced by the elements where go is not defined has 
bounded tree width. We fix values every possible way in which the conditions of Lemma 14.31 can 
be violated and use HOMOMORPHISM-EXT to check whether there is an extension compatible with 
this choice. In order to efficiently enumerate all the possible violations of the second condition, the 
following definition is needed: 

Given a relation of arity r, a bad prefix is a tuple (61, . . . , 6^) € with s < r such that 

(1) there is no tuple (61, ... , bs, bs+i, . . . ,bj.) S i?* for any fe^+i, . . . ,br £ B, and 

(2) there is a tuple (61, ... , bs-i,Cs, Cg+i, . . . ,Cr) £ for some ct, ■ . . ,Cr G B. 

If (61, ... , br) R^, then there is a unique 1 < s < r such that the tuple (61, . . . , 6s) is a 
bad prefix: there has to be an s such that (61, ... , 6s) cannot be extended to a tuple of R^, but 
(61, . . . ,6s-i) can. 

Lemma 4.4. The relation R^ has at most \R^\ ■ {\B\ — 1) ■ r bad prefixes, where r is the arity of 
the relation. 

Lemma 4.5. Let X be a subset of At and let be a mapping from X to B. For every fixed k, 
there is a polynomial-time algorithm ELEMENTARY-EXT(t, X, go) that decides whether go can be 
extended to an elementary homomorphism from At to B, if the tree width of the structure induced 
by At — X is at most k. 

We enumerate the elementary homomorphisms in a specific order defined by the following 
precedence relation. Let 99 be an elementary homomorphism from Aj to B and let ip be an elemen- 
tary homomorphism from Aj to B for some j > i. Homomorphism ip is the parent of (/? (99 is a 
child of Ip) if (p restricted to ^i+i can be written as ip o (pj o . . . o ipi+2- Ancestor and descendant 
relations are defined as the reflexive transitive closure of the parent and child relations, respectively. 
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Note that an elementary homomorphism from Aj to B has exactly one parent for i < n and a 
homomorphism from A„ to B has no parent. Fix an arbiti^ary ordering of the elements of A. For 
< i < n and < j < |j4j \ j4j+i|, let Aij be the union of Ai^i and the first j elements of 
Ai \ Ai+i. Note that Ai^ = Ai+i and = ^i- 

Lemma 4.6. Let ijj be a mapping from Aij to B that can be extended to an elementary homomor- 
phism from Aj to B. Assume that a sequence of width k endomorphisms is given for A. For every 
fixed k, there is a polynomial-delay, polynomial-space algorithm ELEMENTARY-ENUM(i, j, tp) that 
enumerates all the elementary homomorphisms of Ki that extends if) and all the descendants of these 
homomorphisms. 

By calling ELEMENTARY-ENUM(n, 0, qq) (where qq is a trivial mapping from to B), we can 
enumerate all the elementary homomorphisms. By the observation in Lemma l42l this means that 
we can enumerate all the homomorphisms from A to B. 

Theorem 4.7. For every fixed k, there is a polynomial-delay, polynomial-space algorithm that, 
given structures A, B, and a sequence of width k endomorphisms of A, enumerates all the homo- 
morphisms from A to B. 

Theorem 14. 7 1 does not provide a complete description of classes of structures solvable WPD. 

Corollary 4.8. There is a class A of relational structures such that not all structures from A have 
a sequence of width k endomorphisms and ECSP(yl, — ) is solvable WPD. 

Proof. Let A be the class of structures that are the disjoint union of a loop and a core. Obviously, 
SCSP(^, — ) is polynomial time solvable. Therefore, by Lemma [l!4l ECSP(^', — ) is solvable with 
polynomial delay. However, it is not hard to see that A' does not have a sequence of endomorphisms 
of bounded tree width. ■ 

Furthermore, as we will see in the next section it is hard, in general, to find a sequence of 
bounded width endomorphims. Still, we can find a sequence of endomorphisms for a structure A if 
we impose two more restrictions on such a sequence. 

A retraction of a structure A is called a k-retraction if at most k nodes change their value 
according to ip. A structure is a k-core if the only A;-retraction is the identity. A fc-core of a structure 
is any fc-core obtained by a sequence of A; -retractions. 

Lemma 4.9. All k-cores of a structure A are isomorphic. 

Lemma 14.91 amounts to say that when searching for a sequence of fe-retractions converging to 
a A:-core we can use the greedy approach and include, as the next member of such a sequence, any 
A;-retraction with required properties. With this in hands we now can apply Theorem l4.7l 

Theorem 4.10. Let k > be a positive integer and let C be a class of structures such that the k- 
core of every structure in C has tree width at most k. Then, the enumeration problem ECSP(C, — ) 
is solvable WPD. 

Corollary 4.11. If C is a class of structures of bounded tree width then ECSP(C, — ) is solvable 
WPD. 
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5. Hardness results 

The first result of this section shows that finding a sequence of endomorphisms of bounded 
width can be difficult even in simplest cases. 

Theorem 5.1. It is NP-complete to decide if a structure has a sequence of 1 -width retractions to 
the core. 

The second result shows that ECSP(^, — ) can be hard even if every structure in A has a se- 
quence of width-2 endomorphisms. Note that this result is incomparable with Theorem 15. 1[ since 
an enumeration algorithm (in theory) does not necessarily have to compute an sequence of endo- 
morphisms. We need the following lemma: 

Lemma 5.2. If G is a planar graph, then it is possible to find a partition (Vi, V2) of its vertices in 
polynomial time such that G\yi\ and G\V2\ have tree width at most 2. 

Proposition 5.3. There is a class A of relational structures such that every structure from A has 
a sequence of width 2 endomorphisms to the core, and such that the problem ECSP(^, — ) is not 
solvable WPD, unless P = NP. 

Proof. Let ^ be a class of graphs built in the following way. Take a 3-colorable planar graph G 
and its partition (Vi, V2) according to Lemma [S!2l Using colorings we can ensure that G is a core. 
Then we take a disjoint union of this graph with a triangle T having all the colors and a copy Gi of 
G[Vi]. Let A denote the resulting structure. 

Claim 1. A has a sequence of width-2 endomorphisms. 

Let ip h& n 3-coloring of G that is a homomorphism into the triangle, and ij)' the bijective 
mapping from Gi to Then Lpi is defined to act as if) on G, as if)' on G'l and identically on 

T. Endomorphism (/72 is just the 3-coloring of G U Gi induced by if:. The images of and ip2 are 
T U and T, respectively, so all the conditions on a sequence of width-2 homomorphisms are 

easily checkable. 

Claim 2. The Planar graph 3-coloring problem is Turing reducible to ECSP(^, -). 

Given a planar graph G we find its partition (Vi, V2) and create a structure A, as described 
above. Then we apply an algorithm that enumerates solutions to ECSP(^, — ) We may assume that 
such an algorithm stops with some time bound regardless whether G is 3-colorable or not. If the 
algorithm succeeds we can now produce a 3-coloring of G. ■ 

6. Conjunctive queries 

When making a query to a database one usually needs to obtain values of only those variables 
(attributes) (s)he is interested in. In terms of homomorphisms this can be translated as follows: For 
relational structures A, B, and a subset y C ^4, we aim to list those mappings from y to B which 
can be extended to a full homomorphism from A to B. In other words, we would like to enumerate 
all the mappings from y to S that arise as the restriction of some homomorphism from A to B. 
Clearly, this problem significantly differs from the regular enumeration problem. A mapping from 
y to S can be extendible to a homomorphism in many ways, possibly superpolynomially many, 
and an enumeration algorithm would list all of them. In the worst case scenario it would list them 
before turning to the next partial mapping. If this happens it may destroy polynomiality of the delay 
between outputting consecutive solutions. 

In this section we treat the CONJUNCTIVE Query Evaluation problem as follows. 
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CQE(^, B) 

Instance: A e A,M £ B,Y <^ A 

Problem: Output all partial mappings from Y io B ex- 
tendible to a homomorphism from A to B. 

We present two results, first one of them shows that the problem CQE(^, — ) is WPD when A 
is a class of structures of bounded tree width, the second one claims that, modulo some complexity 
assumptions, in contrast to enumeration problems this cannot be generalized to structures with k- 
cores of bounded tree width for > 2. 

Theorem 6.1. If A is a class of structures of bounded width then CQE(^, — ) is solvable WPD. 

Proof We use Lemma |4~T] to show that algorithm CQE-BOUNDED-WIDTH of Figure [U does the 
job. Indeed, this algorithms backtracks only if outputs a solution. ■ 

Theorem l6. ll does not generalize to classes of structures whose fc-cores have bounded width. 

Example 6.2. Recall that the MULTICOLORED Clique problem (cf. |15|) is formulated as fol- 
lows: Given a number k and a vertex /c-colored graph, decide if the graph contains a /c-clique all 
vertices of which are colored different colors. This problem is -complete, i.e., has no time 
f{k)n^ algorithm for any function / and constant c, unless FPT= W[\]. We reduce this problem to 
CQE(^, — ) where A is the class of structures whose 2-cores are 2-element described below. 

Let us consider relational structures with two binary and two unary relations. This structure 
can be thought of as a graph whose vertices and edges have one of the two colors, say, red and 
blue, accordingly to which of the two binary/unary relations they belong to. Let A^, be the relational 
structure with universe {ai, . . . , a^, yi, . . . , yk], where oi, . . . , are red while yi, . . . , are blue. 
Then {oi, . . . , a^} induces a red clique, that is every Oj, Oj (i, j are not necessarily different) are 
connected with a red edge, and each yi is connected to Oj with a blue edge. It is not hard to see that 
every pair of a red and blue vertices induces a 2-core of this structure. Set A = {A^ | k G N}. 

The reduction of the MULTICOLORED Clique problem to CQE(^, -) goes as follows. Given 
a A;-colored graph G = {V, E) whose coloring induces a partition of V into classes Bi, . . . , B^. 
Then we define structures A, IB and a set y C A. We set A = A^, Y = {yi, . . . , y^}. Then let 
B = V U {bi, . . . , bk}, the elements of V are colored red and the induced substructure M[V] is the 



Figure 1: Algorithm CQE-BOUNDED-WIDTH 
Input: Relational structures A, B, and Y = {Yi, . . . , Y^} C A 
Output: A list of mappings tp: Y ^ B extendible to a homomorphism from A to B 
Step 1 set m = 0, if = (/), Si ~ B, i e [m], complete false 
Step 2 while not complete do 
Step 2.1 if TO < £ then do 

Step 2.1.1 search Sm+i until a 6 e 5^+1 is found such that there exists a homomorphism extending 

tf U {vm+i b] and remove all members of Sm+i preceding b inclusive 
Step 2.1.2 if such a b exists then set ip ipiJ {y-m+i 6}, to ;= to + 1 
Step 2.1.3 else 

Step 2.1.3.1 if m 7^ then set ip = V\{yi,....y„^-i} ^"d Sm+i '■— B,m :— m — \ 
Step 2.1.3.2 else set complete:=true 
Step 2.2 else then do 
Step 2.2.1 output If 

Step 2.2.2 set p :— P'\{y;^ t/m-i}}' m := £ — 1 

endwhile 



242 



A.BULATOV, V.DALMAU, M.GROHE, AND D.MARX 



graph G (without coloring) whose edges are colored also red. Finally, 61 , . . . , 6^ are made blue and 
each hi is connected with a blue edge with every vertex from Bi. 

It is not hard to see that any homomorphism maps {ai, . . . , a^} to V and Y to {hi, . . . , b^}, 
and that the number of homomorphisms that do not agree on Y does not exceed hf'. Moreover, G 
contains a fe-colored clique if and only if there is a homomorphism from A to B that maps Y onto 
{61, . . . , If there existed an algorithm solving CQE(^, — ) WPD, say, time needed to compute 
the first and every consequent solution is bounded by a polynomial p{n), then time needed to list all 
solutions is at most k^p{n). This means that MULTICOLORED Clique is FPT, a contradiction. 
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